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CORRELATION  AFTER  ASYMMETRICAL  CLIPPING 
ABSTRACT 

1.  A  general  formula  is  derived  for  the  correlation  coefficient  between 
clipped  waveforms  or  among  detection  sequences,  for  the  case  where  the 
clipping  is  asymmetric  or  the  detection  probability  departs  from  50%. 

The  analytic  arcsine  law  for  symmetrical  clipping  is  rehearsed  and  new 
analytic  forms  are  found  for  asymmetrical  clipping  with  high  positive 
correlation,  numerically  low  correlation  and  high  negative  correlation. 


Introduction 


2.  This  paper  is  about  hard  clipping,  where  a  waveform  or  data  stream 
is  clipped  about  some  level  and  converted  into  a  binary  sequence.  The 
auto-correlation  function  of  this  sequence  is  considered,  and  in 
particular  its  relation  to  the  correlation  function  of  the  original 
signal.  For  symmetrical  clipping,  ie  clipping  about  the  mean  level,  the 
well-known  arcsine  law  was  originally  derived  by  Van  Vleck  in  1943 

(Ref  1),  and  in  acknowledging  this  it  seems  unnecessary  to  distinguish 
between  auto-correlation  and  cross-correlation.  Other  sources  are 
Refs  2  and  3.  Here  we  extend  the  calculation  to  clipping  at  arbitrary 
levels,  where  it  is  believed  the  result  has  not  been  published  before. 
The  calculation  is  quite  straightforward  but  the  answer  has  some 
unexpected  features,  and  is  also  one  that  is  needed. 

3.  There  could  be  some  interest  in  asymmetrical  clipping  in  signal 
processing  itself.  But  the  real  interest  arises  from  the  important 
analogy  between  target  detection  theory  and  data  processing  on  the  one 
hand,  and  clipped  signal  processing  on  the  other  hand.  Thus  sonar  and 
radar  detection  decisions  are  commonly  or  effectively  based  on  whether 
or  not  a  threshold  is  crossed.  The  precise  proportion  of  signals 
crossing  the  threshold  cannot  be  controlled,  so  we  cannot  easily  arrange 
a  50%  probability  P  corresponding  to  the  arcsine  law.  In  practice 
detection  is  often  a  cumulative  process  with  a  low  probability  P  for  any 
given  signal  and  a  wide  departure  from  the  arcsine  law.  The  present 


paper  stands  on  its  own  but  also  serves  as  a  companion  or  back-up  to 
Ref  4,  which  uses  the  asymmetrical  clipping  relations. 


4. 


4.  Here  we  first  set  the  scene  by  recapitulating  in  section  I  on 
some  descriptors  for  the  scatter  diagram,  as  a  lead  in  to  a  particu¬ 
larly  simple  derivation  of  the  arcsine  law  in  section  II.  Section  III 
is  the  main  one,  presenting  the  general  correlation  answers.  Sections 
IV  -  VI  derive  analytic  forms  which  apply  in  various  limiting  condi¬ 
tions  . 

I  CORRELATION  AND  THE  SCATTER  DIAGRAM 

5.  Let  x  and  y  be  random  variates  of  zero  mean,  normalised  if  necessary 
in  order  to  have  the  same  standard  deviation.  Figure  1  represents  the 
scatter  diagram  or  joint  probability  density  function.  The  line  y  *  x 

is  shown  as  the  major  axis,  with  half  axis  length 


J  x2+y2+2xy  -  / 


2x  (l+p) 


The  line  y  *  -x  or  minor  axis  has  half  length 


(1) 


J  (y-x)2  -  J  x2+y2-2xy  -  J 2x2(l-p). 

Thus  the  ratio  of  the  axes  can  be  written  in  terms  of  correlation 
coefficient  p, 


(2) 


E 

equivalent  to 


P 


J 


1  -  P 
1  +  P 


l 

1 


(3) 


(4) 


The  asymptotic  form  for  |p|  small,  holding  up  quite  well  as  |p|  increases, 

is 


P+E  1.  .  (5) 

Using  these  relations  an  eye  fit  of  an  ellipse  to  a  scatter  diagram  can 
rapidly  give  an  approximate  value  for  the  correlation  coefficient. 


5. 


II  SYMMETRICAL  CLIPPING  AND  THE  ARCSINE  LAW 
6.  If  the  detection  decisions  or  clipper  outputs  x^  and  are 
represented  as  either  0  or  1,  symmetrical  clipping  will  be  about  a  mean 
level  0.5.  In  other  words  probability  of  threshold  crossing  P  is  50%. 
Let  us  shift  the  means  so  that 


x  •  y  -  0,  .  (6) 

*1  *  yl2  "  0,25  •  .  (7) 


Taking  these  numerical  values  is  useful  for  the  sake  of  being  specific, 
but  in  fact  they  cancel  out  in  equation  (8)  below.  There  will  be  a 
positive  correlator  output  (x^y^  *  0.25)  for  the  probability  masses  AA 
in  two  quadrants  of  figure  1,  with  a  negative  output  (x^y^  *  ~0.25)  for 
BB.  The  output  correlation  coefficient  after  clipping 

r  -  ^  •  g'25<**  -  *»>  .  (8) 

*1 


7.  To  calculate  B  it  is  usual  to  make  some  assumptions  about  the  dis¬ 
tribution  of  x  and  y,  in  fact  to  assume  they  are  both  normal,  and  then 
integrate.  Here  we  will  assume  instead  that  the  probability  contours  are 
all  elliptical,  as  discussed  further  below.  The  trick  now  is  to  distort 
the  scatter  diagram  by  expanding  it  in  the  (y *  -x)  direction  by  the 
factor  E  ,  so  that  the  ellipses  become  circles,  figure  2.  This  does  not 
change  the  relative  magnitude  of  A  and  B.  The  original  n/2  quadrant 
angle  for  B  now  becomes  2a  where 


tan  a 

giving  a 


(9) 

(10) 


6. 


8.  Since  we  have  central  symmetry  the  probability  associated  with 
a  sector  is  directly  proportional  to  sector  angle,  and  B  is  merely  a/ir. 
Equation  (8)  becomes 


r 

=  1 

4  . 

-  —  arcsin  y 

..  (11) 

~  (¥) 

2 

IT 

•  /  lie. 

arcsin  y  , 

..  (12) 

or  r 

2 

7T 

arcsin  p  - 

..  (13) 

9.  Equation  (13) 

is  the  usual  form 

of  the  so-called  arcsine  law. 

Together  with  the 

form 

(12)  it  can  be  used  to  derive  the  limiting 

forms  - 

P  >  r  «  1 

: 

r  (2/ir)p 

. 

..  (14) 

( 1-p  ) ,  ( 1-r )«  1 

:  (1 

-r)  ~  (2/ir) 

Jl  ( 1  -p~ )  . 

..  (15) 

10.  The  basis  of  this  derivation  is  that  the  original  contours  are 

elliptical.  It  also  seems  that  any  other  contour  shape  must  lead  to  a 

law  different  from  (13).  Now  the  elliptical  contours  come  from  the  product 

of  the  laws  of  probability  distribution  with  distance  d  along  the  (y  *  x) 

and  (y  ■  -x)  directions,  whereas  the  equation  for  an  ellipse  involves  the 

2 

weighted  sums  of  the  d  values.  These  points  can  only  be  reconciled  if 

2 

the  distribution  laws  have  an  exponential  with  d  terms,  ie  the  normal  or 
Gaussian  law.  The  sums  (y-fx)  and  (y-x)  therefore  need  to  be  Gaussian,  and 
since  suitable  combinations  of  the  (y+x)  and  (y-x)  laws  give  the  x  and  y 
laws  these  too  need  to  be  normal.  Thus  our  derivation  shows  that  the 
normality  assumption  is  implicit  in  the  arcsine  law  (13). 


Ill  ASYMMETRICAL  CLIPPING  AND  THE  GENERAL  QUADRANT  FORMULA 


11.  In  the  general  case  clipping  is  about  an  arbitrary  level.  Figure  3 
illustrates  this  for  positive  correlation  and  for  clipping  such  that 
P>0.5,  at  the  same  level  for  x  and  y.  We  keep  the  mean  values  x^  and  y^ 
zero  as  in  equation  (6)  and  find  this  leads  to 

Xl2  =  Yl2  =  (1_P)p2  +  p<1-p)2  s  p(l-p>  •  .  (16) 

12.  We  need  to  know  the  probability  masses  in  the  quadrants  B,  C  and  D 
defined  with  respect  to  the  clipping  levels.  Normal  probability  is 
assumed.  If  L  (x,  y,  p)  refers  to  the  integrated  joint  probability  in 
the  quadrant  where  the  x  and  y  values  exceed  the  stated  values  we  can 
use  the  rules  in  Ref  5  to  write 

B  ■  L  (x,  -x,  -p) 


-  L  (+)  -  L  (-)  .  (17) 

C  *  L  (x,  x,  p)  =  2L  (-)  .  (18) 

D  »  L  (-x,  -x,  p)  *  I  -  2L  (+)  .  (19) 


13.  The  multiplication  term  or  covariance  after  clipping 

-  2  2 

x^  -  CP  +  D(l-P)  -  2BP(1-P) 

-  2PL( -)  -  2( 1-P)  L(+)  +  (1-P)2  .  .  (20) 

14.  The  correlation  after  clipping  becomes 

r  .  Vi  .  iiili  .  2M+X  +  ill  .  .....  (21) 

— r  1-P  P  P 


8. 


15.  Sample  curves  are  plotted  in  figure  4  using  L  values  from  Ref  5. 

They  are  labelled  with  the  positive  x  levels  for  the  clipping,  but 
symmetry  means  that  we  get  the  same  curves  for  negative  values  of  x.  The 
P  values  or  percentage  clippings  then  become  50,  31,  16  and  7  -  perhaps 
of  more  practical  interest, 

16.  The  calculations  are  for  identical  x  and  y  clipping  levels,  and 
this  is  the  common  case,  encountered  for  example  in  auto-correlation. 
Calculations  with  different  levels  should  be  similar  and  straightforward. 
As  a  special  case  note  that  if  the  x  and  y  levels  are  equal  in  magnitude 
but  opposite  in  sign  we  merely  have  to  interchange  the  plots  in  the  two 
quadrants  in  figure  4. 

17.  The  behaviour  in  figure  4  is  surprisingly  complicated,  and  some  of 
the  limiting  forms  are  discussed  in  the  next  three  sections.  We  confirm 
now  that  for  clipping  at  x*0  and  P=0.5  we  do  revert  to  the  arcsine  law 
of  equation  (13). 

18.  The  wave  spectrum  is  of  course  the  Fcurier  transform  of  the  corre¬ 
lation  function,  and  we  remark  here  on  the  ordinary  spectrum  and  the 
auto-correlation  function.  It  may  be  remembered  that  symmetrical  clipping 
produces  a  relative  shift  towards  the  higher  spectral  frequencies.  We 
point  out  that  with  asymmetrical  clipping  this  shift  is  reinforced,  as 

a  consequence  of  the  behaviour  for  both  positive  and  negative  correla- 
t  ion. 

IV  ASYMMETRICAL  CLIPPIHG  WITH  HIGH  POSITIVE  CORRELATION 
18.  Figure  3  also  serves  to  illustrate  the  limiting  case  where  the 
correlation  is  very  high.  The  scatter  ellipse  then  becomes  very  narrow, 
and  the  probability  B  falls  off  from  its  value  in  the  symmetrical  case  as 
the  probability  density  function  Z  [  cf  Ref  3  p  493  ], 


9. 


B  =  Z  arcsin 


The  probability  masses  C  and  D  will  depend  on  the  percentage  detection, 
ie  the  probability  integral  P,  with  only  a  small  correction  for  B  - 


C  =  (1-P)  -  B  , 


D  =  P  -  B 


20.  The  multiplication  term  (compare  equation  (20)) 


=  P(l-P)  -  B  . 


21.  Taking  the  normalisation  from  equation  (16)  the  coefficient  after 
clipping  becomes 


Z  .  /1-P 

1  '  pTThpT  arcsin 


Remembering  that  (1-p)  and  (1-r)  are  small  gives 


(1-r)  = 


P(l-P)  J  * 


This  formula  is  consistent  with  the  general  result  (21)  and  with  the 
limiting  form  (15)  for  symmetrical  clipping. 

V  ASYMMETRICAL  CLIPPING  WITH  NUMERICALLY  LOW  CORRELATION 

22.  For  low  correlation  the  scatter  diagram  elliptical  contours  become 
nearly  circular.  To  deal  with  this  case  we  replace  the  scatter  diagram 
by  an  allied  form  having  very  narrow  elliptical  contours,  figure  5. 

We  define  three  new  independent  variables  abc  such  that 


2  2  2.2  2 

x  =  y  =  a  =  b  =  c 


10. 


Choose  x  and  y  so  that 

x  =  Sp  a+Jl-pb  ,  .  (29) 

y  =  Jp  a  +  J  1-p  c  .  .  (30) 


Here  p  is  introduced  as  an  arbitrary  constant  but  is  found  to  be  equal  to 
the  correlation  coefficient  between  x  and  y.  For  convenience  it  is 
treated  as  positive  in  the  discussion,  but  the  formulae  hold  whatever 
its  sign.  In  figure  5  the  diagonal  lines  correspond  to  fixed  values  of 
x,  and  therefore  to  possible  clipping  points. 

23.  Take  the  general  case  with  asymmetrical  clipping,  the  clipping  level 

being  defined  by  the  overall  probability  P.  Let  s/p"  a  have  the  small 

fixed  positive  value  6,  this  will  increase  the  chance  U  of  exceeding  the 

clipping  level  even  though  the  ,/1-p  b  component  is  still  allowed  to 

vary.  Since  6  is  small  the  distribution  function  Z  for  the  probability 

density  may  be  taken  as  the  value  corresponding  to  P,  and 

P  =  P  +  BZ  .  .  (31) 

(The  lack  of  a  linear  BZ  term  in  equation  (32)  below  means  that  terms  in 
2 

B  need  not  be  included  in  equation  (31)).  The  probability  that  both  x 

2 

and  y  will  exceed  the  clipping  level  (and  give  x^y^  =  (1-P)  )  is  there- 

2  2 
fore  p  ,  the  probability  that  neither  will  do  so  is  (1-p)  and  that  they 

will  have  opposite  signs  is  2p(l-p).  Thus  for  our  fixed  0 
x][y1  =  P2(1-P)2+(1-p)2P2-2p(1-p)P(1-P) 

2  2 

-  6  Z  .  .  (32) 

This  simple  result  brings  out  the  square  law  dependence  on  8,  any  non¬ 
zero  correlated  component  very  reasonably  leads  to  a  positive  correlation. 
We  now  take  the  average  for  all  values  of  8, 


11. 


Xlyl  =  P  a2  Z2  •  .  (33) 

24.  Using  equation  (16)  the  clipped  correlation  coefficient 

x,y,  2  2 

r-  =  ..A  A  =  JL2 — ,  .  n&) 

r  — 2  P(l-P)  ^  ' 

X1 

The  Z  and  P  effects  are  similar  to  those  in  equation  (27),  but  not 

identical.  If  we  now  assume  that  the  distribution  is  normal  the  ratio 

r/  may  be  calculated  as  a  function  of  P  using  standard  tables,  remember- 
~2 

ing  that  a  is  the  mean  variance.  Figure  6  shows  the  symmetry  in  the 
relation.  The  result  is  consistent  with  the  general  formula  (21),  and 
confirms  that  all  the  figure  4  lines  do  indeed  go  through  the  origin. 

2 

For  symmetrical  clipping  P  =  0.5  and  Z  =  (2ira  )  ,  so  that  the  ratio 

becomes  (2/ir)  in  agreement  with  equation  (14). 

VI  ASYMMETRICAL  CLIPPING  WITH  HIGH  NEGATIVE  CORRELATION 

25.  High  negative  correlation  is  illustrated  for  P>0.5  in  figure  7,  which 
allows  the  immediate  identification  of  the  probability  masses  as 

B  =  1-P  ,  (35) 

C  =  0  ,  (36) 

D  =  2P-1  .  (37) 


26.  We  find  for  the  limiting  forms 
x1y1  =  -  (l-P)2  , 

r  =  -  (1-P)/P  .  .  (39) 

If  P<0.5  the  roles  of  C  and  D  are  interchanged,  as  are  P  and  (1-P)  in  all 
these  equations.  Note  that  r  is  independent  of  p  in  all  these  limiting 
conditions.  Equation  (39)  is  consistent  with  the  general  formula  (21),  and 
helps  in  understanding  the  wide  spread  of  the  figure  4  lines. 


A 


12. 


VII  CONCLUSIONS 

27.  The  chief  result  is  the  general  equation  (21),  illustrated  by  the 
plot  in  figure  4,  with  surprisingly  complicated  behaviour.  There  is 
consistency  with  the  special  or  limiting  forms  for  symmetrical  clipping 
(13),  high  positive  correlation  (27),  numerically  low  correlation  (34) 
and  high  negative  correlation  (39). 
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